NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Trust Over Directed Graphs in Multiagent Systems

Akgun, Orhan Eren; Dayı, Arif Kerem; Gil, Stephanie; Nedic, Angelia (June 2023, Proceedings of Machine Learning Research)
Matni, N.; Morari, M.; Pappas, G. J. (Ed.)
We address the problem of learning the legitimacy of other agents in a multiagent network when an unknown subset is comprised of malicious actors. We specifically derive results for the case of directed graphs and where stochastic side information, or observations of trust, is available. We refer to this as “learning trust” since agents must identify which neighbors in the network are reliable, and we derive a learning protocol to achieve this. We also provide analytical results showing that under this protocol i) agents can learn the legitimacy of all other agents almost surely, and ii) the opinions of the agents converge in mean to the true legitimacy of all other agents in the network. Lastly, we provide numerical studies showing that our convergence results hold for various network topologies and variations in the number of malicious agents.
more » « less
Full Text Available
A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

Cui, Leilei; Basar, Tamer; Jiang, Zhong-Ping (June 2023, Proceedings of Machine Learning Research)
Matni, N; Morari, M; Pappas, G J (Ed.)
In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap. Under the formulation of risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to iteratively approximate the robust and optimal controller. The convergence and robustness of the dual-loop policy optimization algorithm are rigorously analyzed. It is shown that the dual-loop policy optimization algorithm uniformly converges to the optimal solution. In addition, by invoking the concept of small-disturbance input-to-state stability, it is guaranteed that the dual-loop policy optimization algorithm still converges to a neighborhood of the optimal solution when the algorithm is subject to a sufficiently small disturbance at each step. When the system matrices are unknown, a learning-based off-policy policy optimization algorithm is proposed for the same class of linear systems with additive Gaussian noise. The numerical simulation is implemented to demonstrate the efficacy of the proposed algorithm.
more » « less
Full Text Available
Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning

Liu, C.; Zhang, Y.; Shen, Y.; Zavlanos, M. M. (January 2021, Proceedings of Machine Learning Research)
Jadbabaie, A.; Lygeros, J.; Pappas, G. J.; Parrilo, P. A.; Recht, B.; Tomlin, C. J.; Zeilinger, M. N. (Ed.)
Full Text Available

Search for: All records